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We present a randomized distributed approximation algoritiim for tiie metric unca- 
pacitated facility location problem. The algorithm is executed on a bipartite graph in 
the CONGEST model yielding a (1.861 + e) approximation factor, where e is an arbi- 
trary small positive constant. It needs 0(«^/'*logf_|_g(«)) communication rounds with 
high probability {n denoting the number of facilities and clients). To the best of our 
knowledge, our algorithm currently has the best approximation factor for the facility 
location problem in a distributed setting. It is based on a greedy sequential approx- 
imation algorithm by Jain et al. (J. ACM 50(6), pages: 795-824, 2003). The main 
difficulty in executing this sequential algorithm lies in dealing with situations, where 
multiple facilities are eligible for opening, but (in order to preserve the approximation 
factor of the sequential algorithm) only a subset of them can actually be opened. Note 
that while the presented runtime bound of our algorithm is "with high probability", the 
approximation factor is not "in expectation" but always guaranteed to be (1.861 + e). 
Thus, our main contribution is a sublinear time selection mechanism that, while in- 
creasing the approximation factor by an arbitrary small additive term, allows us to 
decide which of the eligible facilities to open. 
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Teams" within the SPP 1 183 "Organic Computing", and the Paderbom Institute for Scientic Computation (PaSCo). 



1. Introduction 



Facility location is one of the most studied optimization problems in operations research and cap- 
tures a large variety of applications. A classical motivation is placing facilities (e.g., warehouses) 
in such a way that the combined costs of customer satisfaction and warehouse construction are 
minimized. However, there are also plenty of applications in distributed scenarios. For instance, 
in wireless networks a set of nodes has to be chosen to provide some services (e.g., a distributed 
database). Making such services available incurs costs at those facility nodes, while all remaining 
nodes act as clients. They use the services of the nearest faciUty node, and are charged a cost pro- 
portional to the corresponding distance. The objective is to determine a set of faciUty nodes such 
that the costs caused by the facihties and the clients is as low as possible. 



Formal problem definition. We consider the metric uncapacitated Facility Location problem 
in a distributed setting. Here, we are given a complete bipartite graph G = F U C consisting of 

a set of facilities F and a set of clients C. To each facility i € F an opening cost f G ]R>o is 
assigned. Each edge {/, j} in G is weighted with the value Cij G M>o that represents the costs of 
connecting client j with facility i. The objective is to determine a subset of the facilities to be 
opened and connect every client to at least one open facility in such a way that the sum of the 
opening costs and connection costs is minimized. The linear program representation of the facility 
location problem and its dual program are as follows: 
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The variable indicates whether facility / is open (y, = 1) or closed (>', = 0). The other indicator 
variable Xij has the value 1 if the client j is connected to facility /, and otherwise. The constraints 
(1) guarantee that each client is connected to at least one facility, while the constraints (2) make sure 
that a client can only be connected to an open facility. The problem we consider is metric, since 
the values c,y are required to satisfy the triangle inequahty (i.e., yi,j,i',f : Cjj < Cif + c/,' + c,'^). 
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and uncapacitated, since an arbitrary number of clients can be connected to an open facility. Fur- 
thermore, for the sake of presentation we assume all /, ,c,y to be normalized such that the smallest 
non-zero value is 1 . The dual program will be used in the description of our approximation algo- 
rithm. Intuitively, a aj variable can be seen as the amount the client j is willing to pay for being 
connected to a facility. From the point of view of a facility /, aj is the sum of Cij (the amount j 
pays for a connection to /) and j8,j (the amount it pays for opening /). 

Our algorithm is executed in the COMQEST model, which was introduced by Peleg [PelOO] 
and is commonly used to model the execution of distributed algorithms on graphs: Algorithms 
are executed in synchronous send-receive-compute rounds. In a single round, each node sends a 
message to each of its neighbors in the graph. Note that the messages sent to each neighbor by 
a single node are not required to contain the same information. Once all nodes have sent their 
messages, they receive a single message from each of their neighbors. After all the messages have 
been received, every node is allowed to spend an arbitrary amount of time for computation (i.e., 
computation is for free and we are only interested in the number of communication rounds). The 
end of the computation by all nodes marks the start of a new send-receive-compute round. The 
message size in the COMQSSTmoAeX is bounded. We will limit the size of the messages used in 
our algorithm to 0{Vog{n)) bits. The same limitations were also used in [MW05], [GLS06] and 
[PP09]. This bound is reasonable, because it allows the nodes to send their ID in a single message. 
Due to this constraint on message size, we restrict the values fi and Cij in such a way that it is 
possible to represent them with C?(log(?i)) bits (i.e. to be able to send them in a single message). 

The graph our algorithm is executed on is the complete bipartite graph of clients and facilities. 
This means that within a single round each client can communicate with all facilities and each 
facility can communicate with all clients. Since nodes of the same partition can not communicate 
directly with each other, but instead have to use nodes of the other partition as relays, gathering all 
information about the graph requires Q.{n) rounds. Note that without a limit on the message size 
our problem could be trivially solved in four rounds (all information about the graph is gathered at 
a single node in two rounds, an optimal solution is computed and distributed in two more rounds). 
We want to stress that, although in our algorithm the nodes communicate with all their neighbors 
in each round, it is possible to restate the algorithm such that nodes / and j where cij is "large" 
(more than maXygc(niin,Gf(c,j •+/,))) never communicate with each other. 

Our contribution. We present a distributed approximation algorithm for the metric uncapaci- 
tated facility location problem. It is based on a greedy algorithm by Jain et al. [JMM+03] (from here 
on referred to as GreedyFL) and yields a guaranteed (not in expectation) (1.861 +£)-approximation 
in 0(«^/'*logi_|_g(«)) rounds (with high probability). Our algorithm is executed in the COMQEST 
model on a complete bipartite graph. Although it is strongly related to the GreedyFL algorithm, 
there are (due to the parallel execution) new challenges regarding the selection process that occurs 
when multiple facilities are eligible for opening. The difficulty is that we require the cost of the 
solution computed by our algorithm to be always at most (1.861 + £) times worse than an optimal 
solution (in contrast to similar work by Blelloch et al. [BTIO], where the approximation factor is 
increased by factor 2). Moreover, unlike other work (e.g., [PPIO], where the approximation factor 
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is given "in expectation"), our algorithm provides a worst-case guarantee on the approximation 
factor. 

Related work. During the last two decades, the uncapacitated metric facility location problem 
was of great interest, and a lot of progress has been made concerning the running time and ap- 
proximation factor of sequential algorithms solving it. Aardal et al. introduced the first polynomial 
time algorithm yielding a 3.16-approximation [ATS97]. Improving this approximation factor was 
the topic of a multitude of research papers. For example, Chudak et al. improved the approxi- 
mation factor to {\+2/e) 1.74 [CS99]. The factor has been improved, until eventually Byrka 
designed a 1 .5-approximation algorithm [ByrOV], which, at this point in time, yields the best known 
approximation. Although both last mentioned algorithms yield very good approximation factors, 
they have, due to applying LP-Rounding, high running times. Thus, the design of algorithms with 
slightly worse approximation factors, but better running times, was also of interest in the past. 

Jain and Vazirani used the primal-dual approach to develop and analyze an algorithm with ap- 
proximation factor 3 and a running time of 0{n^\og{n)) [JVOl]. A simplified and faster 0{n^) 
version of this algorithm was introduced by Mettu and Plaxton [MPOO]. Later on, Jain et al. im- 
proved the former results by presenting two algorithms in [JMM+03]: One with running time 
0(mlog(m)) and an approximation factor of 1.861 (which we refer to as GreedyFL) and another 
one with running time 0{rr') and an approximation factor of 1.61 {n denoting the number of nodes 
and m the number of edges in the complete bipartite graph of facilities and clients). Later on, 
building upon [JMM+03], Mahdian et al. improved the factor to 1.52 [MYZ02]. 

Under the assumption that MV % i:'7ZM<f(«'''s(iog(«)))^ Quha et al. showed in [GK99] that no 
polynomial time algorithm with an approximation factor better than 1.463 exists. Thus, an 
improvement of either the lower or the upper bound for the approximation factor, even in the 3rd 
or higher fractional digit, is of great interest. 

The following results concerning the facility location problem can be found in the distributed 
scenario: In [PP09] Pandit et al. present an algorithm yielding a 7-approximation and a running 
time of C?(log(?i)). Their algorithm is a parallel version of the primal-dual algorithm by Jain et al. 
[JVOl] and is - like our algorithm - executed in the COMQSSTmoAel on a complete bipartite graph 
with message size limited to 0{\og{n)) bits. Pandit et al. assume that the difference between the 
opening costs of the cheapest and the most expensive facility can be arbitrary large. It is reasonable 
to drop this assumption, since they (as well as we) require that the facility opening costs and dis- 
tances between clients and facilities are encoded with C?(log(?i)) bits. By dropping this assumption 
and modifying the algorithm and analysis by Pandit et al. in a small way (changing a factor from 2 
to (1 + £)), it is possible to achieve a (3 + e) approximation factor in 0(logi^g(?i)) rounds. Further 
improvement of the factor with this approach is not possible, since the approximation factor of 
[JVOl], which Pandit et al. parallelized, is 3. This means that their algorithm, while faster than 
ours, has a worse approximation factor. 

Recently, in [PPIO] Pandit et al. all presented a technique that can be used to execute greedy 
facility location algorithms (like GreedyFL) in parallel in polylogarithmic time. Although they 
consider a similar problem as we do, their results are quite different from ours: Their approximation 
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factor is 0(1) in expectation (their algorithm can produce an arbitrary bad solution, even though 
this is very unlikely), while we can guarantee an approximation factor of (1.861 + £) in the worst 
case. Also, although they do not state their exact approximation factor, the expected approximation 
factor achieved with their technique cannot, to the best of our knowledge, be decreased below 
4c, where c is the approximation factor of the sequential algorithm (i.e., w 7.444 if used with 
Greedy FL). 

A similar result to [PPIO] was presented by Blelloch et al. in [BTIO]. Instead of the CONGEST 
they use a PRAM model and achieve a (3.722 + e) approximation with running time 0(logj_^g(?i)) 
by parallelizing the GreedyFL algorithm. In order to choose which facilities to open, they use a 
technique introduced by Rajagopalan and Vazirani [RV98]. This technique can also be used to 
execute greedy facility location algorithms (in a distributed manner and polylogarithmic time) and 
yields a guaranteed approximation factor of 2c, where c is again the approximation factor of the 
corresponding sequential algorithm (i.e. 3.722 if used with GreedyFL). 

Other results that also use the bound of 0(log(?i)) on the message complexity are [MW05] and 
[GLS06]. In [MW05] Moscibroda et al. show that in 0{k^) communication rounds a 0{k{nF pY /^log{nF + 
nc)) approximation in 0{k^) communication rounds {rif and «c are the number of facilities, re- 
spectably clients, and p a coefficient dependent on instance parameters) can be achieved in the 
more general non-metric facility location problem. Gehweiler et al. show in [GLS06] that 3 rounds 
are sufficient to compute a C?(l)-approximation for the uniform facility location problem (opening 
each facility incurs the same costs). [GLS06] applies the approach of Mettu and Plaxton [MPOO], 
which has been successful in a lot of other settings as well: In the kinetic setting [DGLIO], in game 
theoretic settings [PT03], for algorithms working in sublinear time [BCIS05], and when confronted 
with perpetual changes to the problem instance [DKPIO]. 

Structure of the Paper. In Section 2 we give a detailed description of our approximation algo- 
rithm and prove its approximation factor by generalizing the techniques and results of [JMM+03]. 
Our main contribution is Section 3. Here, we deal with the problem of selecting facilities to be 
opened that arises due to the parallel execution of the GreedyFL algorithm by Jain et al. and show 
that our distributed algorithm presented in Section 2 terminates in sublinear time. 

2. The distributed Approximation Algorithm 

Our parallel algorithm is based on the GreedyFL algorithm by Jain et al. [JMM"'"03]. As stated 
above, it is executed in the COMQ8STmoA&\ on a complete bipartite graph. We will first describe 
the algorithm and then give an analysis of its approximation factor and running time. 

Algorithm description. Due to the completeness of the bipartite graph, in a single synchronous 
round each client can send to and receive a message from all facilities. The same applies analo- 
gously to each of the facilities. During the algorithm's execution, clients and facilities can be in 
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various states: clients can be not connected or connected, while facilities can be closed, currently- 
paid, or open. Intuitively, currently-paid facilities compete with each other for the permission to 
change their state to open. All clients start in the not connected state and all facilities in the closed 
state. When a client j changes its state to connected, it is assigned to an open facility / := i) 
and never changes its state again. A closed facility can only change its state to currently-paid, 
while a currently-paid facility can either become closed or open. Once a facility is open, it never 
changes its state. The algorithm terminates, once all clients are connected. The solution retumed 
by our algorithm is the set of all open facilities and the assignment <p. Each facility knows now 
whether it is open or not and each client knows the facility it is assigned to. 

Algorithm 1 Executed by client j 
v.yieF: SEND[aj,status{j)] 

2: Execute Algorithm 4 until there are no currently-paid faciUties left 
3: if status{j) = not connected then 
4: Compute Uj := {\ -\- e)aj 



The algorithm is executed in a loop and each passing of the loop is referred to as a phase. The 
behavior of a client during a phase is described in Algorithm 1, while the behavior of a facility is 
described in Algorithm 2. Facilities and clients interact with each other by transmitting messages; 
to each Send operation there exists a corresponding Receive operation. 

The idea of the algorithm is to assign to each client j a variable aj initialized with 1. This 
variable represents the amount a client is willing to pay for being connected to a faciUty and for 
opening this facility in the current phase (i.e., the ttj variable of the dual program). Each not 
connected client j increases this variable in every phase by multiplying it with (1 + e) where e is 
an arbitrary small constant greater than (fixed at the start of our algorithm). After aj is increased, 
all clients send their current aj to all facilities. 

Algorithm 2 Executed by facility / 
"ijeC: Receive [o;j,5';a;M5'(;)] 
2: if (status{i) = closed) then 

U := {j\j € C f\status{j) = not connected} 
4: Compute coveredCost := J^jeu niax(0, aj — cij) 

if (fi < coveredCost) then 
6: status(i) := currently-paid 

Execute Algorithm 3 until there are no currently-paid faciUties left 



Let U be the set of all not connected clients and ^ij := max(a^- — c,-j,0). We say that client j 
contributes to faciUty i, if j3,y > (the cUent pays cij for being connected to and j3,y for the opening 
of facility /). Upon receiving the aj variables, every closed facility / computes Y^jeu Pij' which 
represents the amount not connected clients are willing to contribute to the payment of opening 
facility /. If the contribution to the opening of facility / (i.e. J^jeu Pij) reaches a point such that 
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fi < T^jeu ^ij^ it changes its status to currently-paid. 

Algorithm 3 Executed by facility / 

Ri := uniformly distributed random number in [0, 1] 
2: yjeC: SmD[Ri,status{i)] 

\/j£C: Receiye[S j,status{j)] 
4: Define T; := maxj{Sj) where / := {j\j G C,status{j) = not connected, Uj > Cjj} 

thisRoundOpenedi := false 
6: if (7} = Ri A status{i) = currently-paid) then 

status{i) := open 
8: thisRoundOpenedi := true 

\/j & C : SEND[status{i), thisRoundOpenedi] 
10: yjeC: Receive [5?a?M5(;)] 

/' := {j\j £ C,status{j) = not connected, aj > c,y} 
12: if {status{i) = currently-paid f\Y,jei'™'^{^i (^j ~ ^ij) < fi) then 
status{i) = closed 



Opening all the currently-paid facilities could result in a very bad approximation, since it is 
possible that a single not connected client j is contributing to an arbitrary number of currently- 
paid facihties (i.e. I5ij,l5iij > 0,/ / /') and opening just one of these facihties would be sufficient. 
Thus, a selection procedure is needed to determine which currently-paid facihties are going to 
be permanently opened (change their status to open) and which of them will remain closed in this 
phase (changing their status from currently-paid to closed). Furthermore, it is required to determine 
which client is connected to which facility. All of this is achieved by the interaction of Algorithm 
3 (from the point of view of facility /) and Algorithm 4 (from the point of view of client j). In each 
iteration of Algorithm 3 and 4, a subset of currently-paid facihties is chosen to be opened. The 
development and analysis of these two algorithms is the main contribution of our paper, as they are 
responsible for selecting the facilities (which are going to be opened) in a fast and efficient way. 
The algorithms operate by generating a random number at each facility and then opening a facility, 
if its number is the highest one in its neighborhood (the neighborhood of a facihty i consists of all 
facilities such that there is at least one client that contributes to / and /')■ Only 0(log(?i)) bits 
are used to represent the generated random number, which is sufficient to guarantee that, with high 
probabihty, no two facilities generate the same number. The main objective of Algorithm 3 and 4 
is to estabhsh the following fact: 

Fact 1. In every phase, after the iterative execution of Algorithm 3 and 4 terminates, the following 
is true for all facilities: 

(i) No facility is currently-paid. 

(ii) If facility i is open, then Ljet/' Ay ^ fi> where U' = {j\(p{j) = i}. 

(Hi) If facility i is closed, then T^jeu Pij < fi' where U = {j\state{j) = not connected}. 
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Intuitively, (ii) means that a facility is only opened, if all clients connected to it can (together) pay 
for its opening, and (iii) means that, after a phase ended, there is no closed facility to which clients 
contribute enough to pay for its opening. The fact that a closed facility with Y^jeu Pij ^ cannot 
enter the next phase, as implied by (iii), is of importance. It ensures that the desired approximation 
factor can be guaranteed, which is curcial for (the later introduced) Lemma 2. 

Algorithm 4 Executed by client j 
hVieF: RECEiyE[Ri,status{i)] 

2: Define Sj := maxy(/?,) where J := {i\i G F,status{i) = currently-paid, aj > Cjj} 

3: "iieF : Send[S j,st at us {j)] 

4: V/ € F : RECEiyE[status(i),thisRoundOpenedi] 

5: f := £ F,status(i) = open, aj > Cij ,thisRoundOpenedi = true} 

6: if {status{j) = not connected A J' ^ 0) then 

7: status{j) = connectedlThere is exactly 1 element in /, see Algorithm 3} 

8: (p{j) = i {i G /) 
9:V/GF: SEND[status{j)] 



Note that after the algorithm's termination the cost of the solution computed by our algorithm 
lies in the interval [Y^jec^j/i^ + ^)jlljec^j]- This is due to the following facts: Each client is 
connected to exactly one facihty and this facility is open. Since j3,y = max{aj — c,y,0) a client j 
contributes to the opening of a facility / only after aj covered the costs c,y of connecting / to j. 
Also, since we require that only after fi < Y^jeu Pij facility i changes its state to open, the opening 
of / is completely paid for and since U is the set of all not connected clients, it is made sure that a 
client contributes to the opening of at most one facility. 

The running time of the entire algorithm is bounded by 0(«^/^logf_,_g(«)) rounds: The number 
of phases is 0(logi_,_g(n)), since in the worst case a client has distance 0{poly{n)) to its nearest 
facility and this facility has opening costs of 0(poly{n)) (the and /,• are limited by C?(log(«)) 
bits). Thus, increasing the aj > 1 value iteratively by multiplying it with (1 + £), where e > 0, 
results in 0{\ogl_^_^{n)) rounds. Later, we will show that with high probabiUty Algorithm 3 and 4 
terminate after 0(n^/'^log(n)) rounds. The value chosen for e not only determines the runtime, but 
also the approximation factor, which is 1.861(1 + e)^. 

Analysis of the approximation factor and runtime. Intuitively, the GreedyFL algorithm 
can be seen as continuously and simultaneously increasing the aj values. This allows GreedyFL 
to open a facility i in the exact moment it is paid for (i.e., when Y.jeu ^ij = fi)'^ a property which 
is essential for proving its approximation factor In contrast, our algorithm increases the ay values 
in discrete steps and thus, in general, it may occur that Y^jeu ^ij ^ fi- The goal of this section is 
to show that - even though our algorithm may yield a very different solution from the sequential 
algorithm - the costs of the optimal and our solution do not differ too much. Namely, we show 
that our approximation factor is at most (1 +e)^ times greater than the approximation factor of 
GreedyFL. Our proof uses similar arguments as, and is based on, the one of Jain et al., but it turns 
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out that in order to apply the results of Jain et al., we need to get a deeper insight into the structural 
properties of the linear programs involved. 

The aj and jS,y values computed by our algorithm can be interpreted as a solution to the dual 
program of the facility location program presented in the introduction. However, they do not nec- 
essarily represent a feasible solution: our algorithm opens a facility when Y^jeu l^ij ^ fi (summing 
over all not connected clients) and not, as required by constraint (4), over all the clients. If we 
choose 7 such that := and j8;*- := a*j — c/j constitute a feasible dual solution, the Duality The- 
orem yields that our solution is by a factor of at most 7 more expensive than an optimal solution. 
For their original, sequential algorithm Jain et al. gave an upper bound of 1.861 for 7 [JMM+03]. 
They denoted the used technique as dual fitting with a factor-revealing LP. We will construct such a 
factor revealing program (see FacRev-Dist) for our algorithm and establish a structural connection 
between it and the LP by Jain et al. (see FacRev-Jain) for their sequential algorithm {GreedyFL). 

To this end, consider a single facility / and assume, that w.l.o.g. aj > ycjj holds only for the 
first k clients and that a\ < a2 < ■ ■ ■ < (Xk- The next two lemmas express the constraints imposed 
by both the algorithm and the facility location problem. They resemble similar lemmas to those 
used in the proof for the sequential algorithm (cf. [JMM+03]) (their proofs can be found in the 
appendix). 



Lemma 1. For any two clients j, f and a facility i, we have ^ GCf + + c, 



'J- 

Lemma 2. For every client j and facility i, the execution of the algorithm guarantees that Y4=j max(ay - 
(l+£)c,7,0) <(!+£)/;•. 

Our algorithm ensures that the constraints stated by the two lemmas above are met. Thus, we now 
want to find the minimum value for 7 such that Lj=i ~ c,y,0) < /,■ holds, without violating 

the lemmas' statements. Equivalently, we want to maximize the ratio (Ly=i + L7=i 

(again, without violating the constraints). This formulation yields the following family of linear^ 
programs, referred to as the factor-revealing LP. 

FacRev-Jain: 

maximize Zk = . , U 

Ji+Lj=l <^'ij 

subjectto aj < aj+i Vj € {1, . . . 1} (1*) 

aj < ai + dj + ai Vj,/ g {1,...,^} (2*) 

i:t^.max(ay-Q/,0) < f Vj G {1, . . . (3*) 

aj.CijJi > V7e{l,...,^} (4*) 



Note that this formulation is not a linear program, but can be transformed easily into one. 
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FacRev-Dist: 

maximize Zk = r'k 

Ji+Lj=i Cij 

subject to aj < aj+i \/j e{l,...,k-l} (1**) 

aj < {l + e){ai + cij + cu) \fj,le{l,...,k} (2**) 

lt,-max(a,--(l + e)ctf,0) < {l + e)fi Vj G {1, . . . (3**) 

ccj,cij,fi > \/je{l,...,k} (4**) 

Note that constraints (1*) resp. (1**) are only used to sort the aj values in order to ease the for- 
mulation of constraints (3*) resp. (3**). The following theorem quantifies the correlation between 
our (FacRev-Dist) and the original factor-revealing LPs (FacRev-Jain). 

Theorem 1. For any k £N, the optimal solution to FacRev-Dist is bounded by (1 -|- e)^1.861, and 

thus the approximation factor of our algorithm is also (1 + e)^1.861. 

Proof. Jain et al. showed that Zk can be at most 1.861 in FacRev-Jain. In our modified version Zk 
can be at most (1 + £)^1.861: Fix a problem instance by setting the Cij variables and the f variable 
to arbitrary values > and consider the following two propositions. 

Proposition 1. In an optimal solution a := (ai ,a2,..., CCk) to FacRev-Jain, eC at least one of 
the two properties is true: 

(i) Uj = if + ZqeQj Ciq)/\Qj\, where Qj := {q\q G C A ttj - Ciq > A a^- < a^} 

(ii) 31 €C:ai = {fi + LqeQi(^iq)/\Ql\ '^nd aj = ai+Cji + cu 

Proof. Note that the term (/,• + Y^qeQj Ciq) /\Qj\ is fixed. Since we deal with a maximization prob- 
lem, which is bounded (see constraint (3*) of FacRev-Jain), each aj is bounded. Since a represents 
an optimal solution, for each Uj there must be at least one constraint that bounds Uj and is also 
tight. If this tight constraint is of the form Lf=ymax(a, — c,7,0) < f, (i) holds. 

Otherwise, there must be at least one tight constraint of the form aj < Ol/t, +c;y + c/i| , i.e. 3ki G 
C : aj = + -I- c,ytj . This argument can be analogously applied to ki : either = {f -\- 
'LqeQi, Ciq)/\Qki\, there exists a k2 such that a^i = 0Ck2 +c/i2- Like this, we can build 
a recursion for aj = Uk^ + Cij + cik^ , which terminates as soon as we reach a variable a^, with 
CCk, = if + Hqe Qt, )/\Qk, \ and thus (i) holds for kt . 

Note that for every aj there must be such a terminating sequence, since constraint (3*) is the 
only constraint which gives an absolute upper bound on the variables. Otherwise, the aj would 
be unbounded. The recursion yields Uj = Uk, + Cik^ + Ciki + Cik^ + Cik^ + ■■■+ Cik,_i -\- Cik, , implying 
cij > Uk, + Cij -I- Cik, ■ Since also aj < Uk, + Cij -\- Cik, (consfi-aint (2* )), we have aj = Uk, + c,y -h Ci^, , 
proving (ii). □ 

Proposition 2. Given an optimal solution a := (ai , 052, • • • , cCyt) to FacRev-Jain and an optimal 
solution a' := (aj , 012) • • • ) o;^ FacRev-Dist, we have V; G C : < (1 + e)'^aj. 
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Proof. Since the constraints in FacRev-Dist are a relaxation of the constraints of FacRev-Jain, the 
solution a is also a feasible solution of FacRev-Dist. Based on Proposition 1 we know that to any 
aj at least one of the following cases appUes: 

(i) ay = {fi + Y.qeQj'^iq)/\Qi\ i-^- the constraint ^^^^ max ( ay — c,7,0) < fj is tight. Notice that 
for a'j satisfying the constraint L5^^ymax(ay — (1 + £)c,7,0) < (1 + £)/;• we have, due to the 
set {c,7|ay > Cii] containing the set {cii\a'j > (1 + e)c,7}, aj < (1 + £)ay. 

(ii) According to Proposition 1 we know that 3/ € C : a/ = {fi + 'LqeQi ^iql\QA ^'^^ = Of/ + 
Cji + Cii. Constraint (2) of FR-Dist guarantees that aj < (1 + £)(a/ + cy, +0,7). Since we can 
apply case (i) for /, we have a/ < (1 + £)a/, which yields a'j < (1 + £)((1 + £)a/ + cy, + c,7) < 

(l+£)2(a/+Cy,- + C,7). 

□ 

Given any instance of the facility location problem, we know that the optimal solution to the 
FacRev-Jain LP is bounded by 1.861 and that, by Proposition 2, the solution to FacRev-Dist for 
the same instance is at most by a factor of (1 + £)^ larger. Since the problem instance was chosen 
arbitrarily, the theorem's statement follows. □ 

3. Facility Selection Process 

In this section we are only concerned with a single phase of our algorithm (i.e. the loop in which 
Algorithm 3 and 4 are invoked and which establishes Fact 1). We show that it terminates after 
0(?i^/^log(«)) rounds with high probability. 

To simplify notation, F and C do not refer to all facilities resp. all clients (as was the case before). 
Instead, let F represent the set of all facilities that are in the currently-paid state and C denote the 
set of all not connected clients contributing to a currently-paid facility. All the other clients and 
facilities effectively do not take part in the execution of Algorithm 3 and 4 in the considered phase 
and thus can be ignored. We consider the bipartite Graph G = {FUC,E). There is an edge in E 
between / G F and j € C if and only if client j contributes to / (i.e. ay — c,y > 0). Furthermore, we 
assume that there are no isolated nodes (i.e. deg(v) > 1 for all v £ FUC). Let « = |F| + |C|. By 
Gp '■= {F,Ef) we denote the Facility Graph, where there is an edge {/,/'} between two facilities 
/, /' S F if and only if they share a common client in G. We use deg(-) and A'^(-) to denote the degree 
and neighborhood of a node in G respectively. If the neighborhood includes the node itself, we 
write A'^^(-). Similarly, we use deg^ (•), Nf{-), and Np{-) to denote the corresponding properties in 
Gp. 

In each execution of Algorithm 3 and 4 the set F of currently-paid facilities shrinks (they change 
their status to closed or open). Also, with every facility changing its state from currently-paid to 
open clients that are connected to those newly opened facilities are removed from C. Removing 
these clients form C can potentially cause currently-paid facilities to change their state to closed 
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(they lose contributing clients and their opening costs are not paid for anymore). Since it cannot be 
guaranteed that a currently-paid facility which loses a contributing client is not longer fully paid 
for, applying a distributed MIS (maximal independent set) algorithm (e.g. Luby's MIS algorithm 
[Lub85]) on Gp does not help to solve our problem: A facility that loses a contributing client (i.e. 
a facility that is not part of the MIS) might still be fully paid for If such a facility is not opened 
in the current phase (i.e., (iii) of Fact 1 does not hold), clients contributing to it wiU raise their aj 
values in the next phase, which in turn could violate the constraints of the FacRev-Dist LP. Since 
shrinking F by removing currently-paid facilities that are not paid for only decreases the runtime of 
Algorithm 3 and 4, we will assume (in order to consider the worst case) that facilities never change 
their status from currently-paid to closed as the result of chents changing their state to connected 
(i.e., losing contributing clients). 

A simplified probiem description. In order to alleviate describing the shrinking process of 
the set of currently -paid facilities, we simplify the analysis of Algorithm 3 and 4 by omitting the 
information about the values of c,y and fi and simulating the distributed execution of the algorithms 
by the algorithm FacilitySelect. Its input is the bipartite graph G and it possesses global 
knowledge. 

Algorithm 5 FacilitySelect 
\: while (/^ 7^0) do 

2: / := {Subsequently, determine I QF such that {i, i'} ^ Ep for all i, i' G 1)) 
3: for all / G F : do 

4: Generate a uniformly distributed random value r,- G [0, 1] 

5 : if (r; > max;/ r,/ ) then 

6: Add / to / 

7: Ehminate A'^+(/) from G 



As the execution of FacilitySelect progresses, F , C, E, and n change. We denote these sets 
and n in round t as Ft, Q, Ef and However, most of the time we consider only the effect of 
a single iteration of the while loop on the graph. In these cases, we omit the time parameter and 
simply use F ,C,E, and n to refer to the corresponding values in the considered iteration. 

Time required to Slirinl< thie bipartite grapll. This paragraph analyzes the time required to 
remove all nodes from G. It is easy to see that a fixed facility / G f is selected with probability 
1 / (deg^(/) + 1) by this strategy. First, we show that, in expectation, FacilitySelect removes at 
least l^l / |F| clients and at least max{|F| , l^j / |F|} edges in a single iteration of the while loop. 

Lemma 3. The expected number of clients removed by FacilitySelect in a single iteration of 
the while loop is at least \E \ /\F\. 
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Proof. For a facility i ^ F , let X, denote a random variable counting the number of clients j £ C 
removed because of / being added to / or not. That is, 

^_|deg(/) if/G/ 

lO otherwise . 

Since / is an independent set in Gp, the total number of clients removed in one iteration can 
be described by the random variable X := L/gf^(- Using the inequality deg^ (/) + 1 < \F\, we 
compute: 

□ 

Lemma 4. The expected number of edges removed by FacilitySelect in a single iteration of 
the while loop is at least max{|f | , l^l / \F\}. 

Proof. First note that, since by Lemma 3 the expected number of clients eliminated is at least 
l^l / |F| and there are no isolated clients, the expected number of edges removed is at least j^l / 
It remains to show that the expected number of edges removed is at least For a facility / G F, 
let us define random variables Z, counting the number of edges removed because of / being added 
to / or not. That is, 

lO otherwise . 

Since / is an independent set in Gf, the total number of edges removed by strategy FacilityS- 
elect can be described by the random variable Z defined as Z := Y^iep'^i- Using the inequality 
degp(/) + 1 < ZjeN{i) deg(j). we compute: 

□ 

As an corollary, we get that, in expectation, the number of edges is reduced by at least its square 
root: 

Corollary 1. The expected number of edges removed by FacilitySelect in a single iteration of 
the while loop is at least \/\E\. 

Proof. Lemma 4 and the equality jFj = |F| • imply max{|F| , |^} > ^ 

Lemma 5. If the number of edges is greater or equal n^/n, the expected number of clients removed 
by FacilitySelect in one iteration of the while loop is at least \/\C\. If it is smaller or equal 
n^/n, the expected number of removed edges is at least \fn^Jn. 
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Proof. For the first statement, we can bound the expected number of removed clients with the help 
of Lemma 3 as follows: 

\F\ \F\ n 

The second statement follows immediately from Corollary 1 . □ 

We call a round t edge-heavy, if in this round the number of edges is at least nt^/n. Otherwise, 
we refer to it as edge-light. 

Lemma 6. For an arbitrary round t, consider the effect of several iterations of FacILITYSelecT 's 
while loop starting at round t. 

(a) After \\/nt (not necessarily consecutive) edge-heavy rounds, at least \Ct\ clients are re- 
moved in expectation. 

(b) After ^y^Wry^ (not necessarily consecutive) edge-light rounds, at least :^^\Et\ edges are 
removed in expectation. 

Proof. We prove only the first statement, since the second can be proven analogously. Let Y denote 
a random variable counting the number of clients removed during edge-heavy rounds. If 



2' 

Y < \Ct\ /2, the number of clients in each of the considered rounds was at least |Q| /2. Thus, by 
Lemma 3, in each of these \^/r^ rounds at least ^y]Q\J2 chents were removed in expectation. If 
J' > IQI /2, the number of removed clients is trivially lower bounded by \Ct \ /2. We get: 



□ 

Lemma 7. For an arbitrary round t, consider the effect of several iterations of FacilitySelect 's 
while loop starting at round t. 

(a) The probability of removing at least |Q| clients after ^ edge-heavy rounds is at least a 
constant > 0. 

(b) The probability of removing at least \Et \ edges after ^ ^Jr^'^nt edge-light rounds is at least 
a constant > 0. 

Proof. We prove only the first statement. The second can be proven analogously. 

By Lemma 6 |Q| clients are removed after edge-heavy rounds in expectation. Let X 

define a random variable counting the number of clients removed after \^fnt edge-heavy rounds. 
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Now define to be the probability that more than \ Ct \ clients are removed. We can bound ^[X] 
in the following way: 

^ic,i<£m<,„+(i-M)^ic,i 

Solving this inequality for /i yields /i > c, where c is a positive constant. □ 
We are now ready to prove our main theorem. 

3 

Theorem 2. Algorithm FacilitySelect terminates with high probability after 0{m log(?i)) it- 
erations of the while loop. 

Proof. By Lemma 7, starting at iteration t we lose at least a constant fraction of either the clients 
or the edges with a non-zero, constant probability after at most 2v^+ ^yJnt^/rTt < nj^ rounds. 
Partition the number of iterations done by FacilitySelect into phases of n^l'^ rounds. Let Xi 
denote a binary random variable indicating whether the /-th phase was good (we lost at least the 
above mentioned constant fraction of either clients or edges). We need about 21og?i good rounds 
to ensure that the problem size has become 0(1). Since we have Pr{Xi = 1) > c for some positive 

constant c, we get for a := 2l.i={ ^i- 

E[X] > 21og(?i) . 

That is, in expectation it takes about - • 21og(?i) phases until FacilitySelect terminates. Apply- 
ing a standard Chernoff bound yields: 



Pr(^\X-E[X]\>^^ <2-4i^W/3 



In other words, FacilitySelect terminates with high probability after at most ^21og(?i) steps. 

□ 



4. Conclusion and Outlook 

We presented a parallel execution of a greedy sequential algorithm for the facility location problem 
and showed that we can preserve its approximation factor. However, there are other sequential 
facility location algorithms based on the greedy approach introduced by Jain et al. that yield (better) 
approximation factors of 1.61 [JMM+03] and 1.52 [MYZ02]. It might be possible to execute them 
in the COMQSST model with a sublinear running time using the technique we introduced in this 
paper. 

When our algorithm chooses which facilities should be opened, it bases its decision solely on the 
information provided by the bipartite graph G used in Section 3 and all other information (client 
contribution, distance, facility opening costs ect.) is abstracted away (Section 3 provides reasons 
for this abstraction). At this level of abstraction the lower bound of solving the problem is y/n: 
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consider a bipartite graph with facilities with degree of ^Jn— 1, and ( clients with degree 
2. Edges are chosen in such a way that any two facilities are in hop distance of 2 of each other. It 
would be interesting to know in what way the entire available information can be used to improve 
the runtime and whether a poly-logarithmic runtime is possible with the same approximation factor. 
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A. Proofs for Lemma 1 and 2 



Proof of Lemma 1. If (Xj/{ \ + £) < oty/ the lemma obviously holds. Thus, let us assume the op- 
posite. Let t be the phase when our algorithm connects client / to some facility /' and s the 
phase /' was opened. Facility /' was either opened before / was connected to it {s < t) or opened 
exactly in the same phase client / was connected to it {s = t). Since aj > we know that 
client j was connected in phase u > t + I. At time s < t the contribution of client j was not 
enough to cover the distance between j and since otherwise j would be connected in phase s 
with This means that once the contribution of j reaches Ci'j, client j will be connected to 
Since the contribution is increased by multiplying the current a value with (1 +£), it can not be- 
come greater than (1 +e)c,/j, hence aj < (1 + £)c,'y. Thanks to the triangle inequality we have 

aj/{l +e) < Cfj < Ciiji + df + Cij < ay + C,y + C;y. □ 

Proof of Lemma 2. To prove by contradiction, we assume that the inequality does not hold. Thus, 
we have Lf=ymax(ay — (1 + £)c;7,0) > (1 + £)fi- Since we ordered the clients according to their 
a value, we know that for 1 > j we have «/ > ay. Let t be the phase client j was connected to a 
facility. By the assumption facility / is fully paid for (i.e. /'s current payment is > /,) in phase s <t. 
There must be at least one client /, with j <l < k and ay — (1 + £)c,7 > which was connected to 
/ in a phase q < t. For this client /, we have a/ < ay, since the algorithm will stop increasing a/ 
when I is connected to a fully paid facility (see Fact 1). □ 
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